Model Selection

Sliding Window Attention

# Sliding Window Attention

H2o Danube 1.8b Base

A 1.8B parameter base language model trained by H2O.ai, based on an improved Llama 2 architecture with 16K context length support

Large Language Model

Transformers English

Mistral 7B Instruct V0.1

Mistral-7B-Instruct-v0.1 is a version of the Mistral-7B-v0.1 generative text model fine-tuned with various public dialogue datasets for instruction following.

Large Language Model

Nat Base In1k 224

NAT-Base is a Vision Transformer model trained on ImageNet-1K, which uses the neighborhood attention mechanism for image classification.

Image Classification

Transformers Other

Nat Small In1k 224

NAT-Small is a hierarchical vision transformer based on neighborhood attention, designed for image classification tasks.

Image Classification

Transformers Other

Nat Mini In1k 224

NAT-Mini is a lightweight vision Transformer model based on neighborhood attention mechanism, designed for ImageNet image classification tasks

Image Classification

Transformers Other

Dinat Mini In1k 224

DiNAT-Mini is a hierarchical vision Transformer model based on neighborhood attention mechanism, specifically designed for image classification tasks.

Image Classification

Swin Tiny Patch4 Window7 224 Finetuned Braintumordata

Vision model based on Swin Transformer architecture, fine-tuned specifically for brain tumor image analysis

Image Classification

Longformer Base 4096 Spanish

A Spanish long-document processing model developed based on RoBERTa checkpoints, supporting sequence lengths of up to 4096 tokens

Large Language Model

Transformers Spanish

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase